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INTRODUCTION 


It  has  long  been  hypothesized  that  genetic  variation  is  responsible  for  observed 
differences  in  cancer  risk  and  susceptibility  amongst  the  human  population.  Mutant  alleles 
of  dominant  highly  penetrant  breast  cancer  genes,  including  BRCAl  and  BRCA2  (1-3),  do 
not  occur  frequently,  and  hence  account  for  only  a  small  proportion  of  breast  cancer 
cases.  On  the  other  hand,  several  studies  have  suggested  an  association  between  low 
penetrant  alleles  and  breast  cancer  risk.  Although  the  contribution  of  low  penetrant  alleles 
to  the  individual  breast  cancer  risk  is  relatively  small,  they  can  contribute  to  a  large 
proportion  of  breast  cancer  cases  in  the  population  because  the  risk-conferring  alleles  of 
these  genes  are  common. 

Identification  and  cloning  of  low  penetrant  alleles  that  increase  the  risk  of  breast 
cancer  is  challenging  because  the  association  methods  for  such  studies  require  large 
populations  to  achieve  meaningful  statistical  analysis  and  very  dense  genetic  maps  to 
facilitate  genome-wide  genotyping  (4,5).  Although  microarray  technology  has  been 
developed  to  the  point  where  it  could  be  applied  for  parallel  analysis  of  genome-wide 
genot3q)ing  (6),  the  dense  genetic  maps  required  for  large  population  based  association 
studies  are  currently  being  constructed  for  fixture  genome-wide  applications  and  will  not 
be  available  for  several  years.  At  present,  the  candidate  gene  approach  remains  the  most 
logical  and  practical  strategy  to  identify  these  risk  enhancing,  low  penetrant  variants  or 
single  nucleotide  polymorphisms  (SNPs).  Until  now,  a  major  obstacle  with  investigating 
the  risk  associated  with  multiple  candidate  genes  has  been  a  lack  of  technology  for  large- 
scale  genotyping  of  large  populations.  Consequently,  many  studies  have  focused  efforts 
on  only  1  or  2  genetic  polymorphisms,  and  even  in  these  cases  the  analysis  was  only 
limited  to  relatively  small  sample  sizes.  In  the  context  of  the  ideas  program,  we  propose 
to  exploit  the  high  throughput  power  of  SNParrays  to  simultaneously  genotype  31 
different  genetic  polymorphisms  derived  from  26  genes  in  a  well  defined,  representative 
population -based  sample  containing  a  large  number  of  subjects.  We  have  selected  genetic 
polymorphisms  in  genes  involved  in  different  aspects  of  carcinogenesis  (7-41).  For 
example,  cell  cycle  regulatory  genes  such  as  CDK-inhibitors,  and  cyclins;  carcinogen 
metabolizing  enzymes  such  as  CYPs,  GSTs  and  NATs;  immune  system  genes  such  as 
interluekins  and  TNF;  and  genes  involved  in  other  pathways  involved  in  cancer  (e.g.  p53, 
PTEN,  XPD-DNA  repair  gene).  We  have  aceess  to  the  Ontario  Familial  Breast  Cancer 
Registry  (OFBCR),  which  is  the  largest  population  based  breast  cancer  registry  in 
Canada.  We  also  have  support  from  the  established  microarray  facilities  of  the  Ontario 
Cancer  Institute  and  Samuel  Lunenfeld  Research  Institute  in  Toronto. 

The  main  objective  of  the  proposed  work  is  to  identify  low  penetrant,  yet 
commonly  occurring,  genetic  polymorphisms  which  contribute  to  the  risk  of  developing 
breast  cancer.  Furthermore,  this  approach  has  the  potential  to  identify  novel  genetic 
factors  associated  with  breast  cancer  risk,  which  may  result  in  the  development  of 
innovative  therapies,  and  a  fuller  understanding  of  genetic  variation  in  response  to 
therapy.  The  establishment  of  the  proposed  approach  will  prepare  us  for  large-scale 
genotyping  involving  hundreds  or  even  thousands  of  candidate  genes  in  large  define 
populations.  This  will  lead  to  a  more  complex  analysis  of  gene-gene  and  gene- 
environment  interactions  than  is  currently  possible.  Advances  in  disease  etiology  will 
significantly  expand  our  abilities  to  design  strategies  for  the  prevention  of  breast  cancer 
development  and  progression. 
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STATEMENT  OF  WORK 


Task  1 :  Characterization  of  polymorphic  alleles  by  SSCP.  Months  1-8 

a.  Design  of  SSCP  primers  for  32  sites 

b.  Screen  by  SSCP  analysis  for  all  possible  alleles  at  each  locus 

c.  Sequence  the  SSCP  patterns  (appr  3  per  loci)  and  identify  the  all  possible  genotypes 

Task  2:  Designing  of  oligonucleotides  and  sample  microarravs.  Months  4-8 

a.  Design  different  sets  of  oligonucleotides  ( perfect  matches  and  mismatches) 

b.  Customize  sample  chips  for  quality  control  of  hybridizations 

Task  3:  Optimization  of  the  hybridizations  using  PCR  probes.  Months  8-16 

a.  Prepare  PCR  probes  using  control  specimens 

b.  Optimize  the  hybridization  conditions 

c.  Evaluate  the  accuracy  of  detection  for  every  polymorphic  site  using  a  probes  with 

different  allelic  combinations  for  each  polymorphism 

d.  Redesign  oligonucleotides  and  chips  in  order  to  increase  the  quality  and  accuracy  of 

detection 

Task  4:  Genotvping  of  900  specimens  for  32  polymorphisms.  Months  16-32 

a.  Production  of  microarray  chips 

b.  Preperation  of  flourescent  labelled  PCR  probes  for  each  patient 

c.  Hybridization  of  chips  at  optimized  conditions 

d.  Reading  and  analysis  of  the  chip  signals 

e.  Quality  control  experiments  at  different  intervals  using  the  control  specimens  to 
ensure  the  reproducibility  of  results 

Task  5 :  Data  and  statistical  analysis.  Months  32-36 

a.  Repeat  and  conformation  experiments 

b.  Complete  the  reading  of  every  slide  and  prepare  the  data  for  statistical  analysis 

c.  Univariate  analysis  of  the  data 

d.  Exploratory  multivariate  analysis  of  the  data 
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Design  and  Production  of  SNParrays  (Task  3  and  Task  4) 

Support  oligonucleotides  are  designed  to  be  printed  on  SNParrays  and  bind  allele 
specific  probes.  Each  support-oligonucleotide  contains  an  anti-TAG  sequence  which  is 
complementary  to  the  TAG-sequences  on  each  of  the  allele-specific  oligonucleotide 
sequences.  Each  anti-TAG-sequence  is  also  attached  to  a  15-mer  poly  (T)-tail.  The  anti- 
TAG  sequences  on  the  support-oligonucleotides  are  designed  to  hybridize  the 
fluorescently  labeled  allele-specific  oligonucleotides,  thus  attaching  them  to  the 
designated  spots  on  the  glass  slides.  The  poly  (T)-tails,  on  the  other  hand,  are  designed  to 
increase  the  efficiency  of  support-oligonucleotides  to  bind  to  the  glass  surface  during 
printing. 

Before  printing,  support-oligonucleotides  are  diluted  to  lOOpM  concentration  in 
3xSSC  buffer  and  placed  in  384  well  V-bottom  distribution  plates.  The  arrayer  is 
programmed  to  print  these  support-oligonucleotides  according  to  the  design  given  below 
(Figure  1).  Each  support-oligonucleotide  is  printed  in  duplicate  for  validation  purposes. 
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Figure  1:  Design  of  the  SNP  arrays.  Each  support  oligonucleotide  is  printed  in  duplicate. 
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The  SNParrays  are  printed  on  slides  with  poly-L-lysine  surface  chemistry  by  the 
microarray  facility  of  Samuel  Lunenfeld  Research  Institute  (SLRI)  of  Mount  Sinai 
Hospital  (MSH)  in  Toronto.  The  mechanism  of  poly-L-Lysine  slides  to  bind 
oligonucleotides  is  very  similar  to  the  super-amine  slides.  The  slides  are  covered  with  a 
positively  charged  chemical,  Lysine  in  this  case,  for  the  negatively  charged 
oligonucleotides  to  bind  where  the  poly  (T)-tail  enhances  the  strength  of  binding.  A 
Virtek»ChipWriterTM  Professional  (Pro)  arrayer  is  used  for  printing  spots  with  a  diameter 
of  120|J.M.  The  distance  between  the  centers  of  two  adjacent  spots  is  200|xM.  Each 
support  oligonucleotide  is  placed  in  two  different  array  locations  on  the  distribution  plates, 
and  two  pins  are  used  to  remove  oligonucleotides  from  these  two  locations  and  print 
duplicate  arrays  on  the  same  slide  simultaneously  (Figure  2). 

The  spots  on  the  slides  are  rehydrated  after  printing  process.  A  humidity  chamber 
is  filled  with  100ml  of  IxSSC  solution  and  the  slides  are  placed  in  the  chamber,  and  kept 
there  for  60  seconds  (array  side  is  facing  to  the  solution).  The  arrays  are  then  snap-dried 
for  a  few  seconds  on  a  lOO^C  hot  plate  (array  side  facing  up).  Rehydration  process 
increases  the  exposure  of  oligonucleotides  to  probes  during  hybridization.  After  printing, 
oligonucleotides  on  the  slide  are  fixed  in  a  UV  cross-linker  at  600mJ.  Unbound 
oligonucleotides  and  excess  salt  were  washed  off  the  slides  for  15  minutes  at  room 
temperature  in  a  solution  of  170mM  succinic  anhydride  (in  a  solution  of  1,2  methyl 
pyrollidinone  and  40mM  boric  acid).  This  step  neutralizes  the  positively  charged  surface 
of  the  slide  to  limit  the  hybridization  of  the  negatively  charged  probe  to  the  support- 
oligonucleotides  only,  and  not  to  the  slide  surface.  The  slides  are  then  rinsed  in  distilled 
water  for  one  minute,  and  then  kept  in  95%  ethanol  for  an  additional  minute.  They  are 
then  spin-dried  at  600  rpm  for  10  minutes. 

The  printing  quality  of  SNParrays  is  controlled  with  a  Cy3  labeled  control 
oligonucleotide  hybridization.  Synthesized  poly  (A)-tail  probes  are  attached  to  Cy3,  and 
hybridized  under  the  hybridization  conditions  described  below.  The  quality  of  every  new 
batch  of  printed  slides  was  validated  using  this  approach  (Figure  2). 
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Figure  2:  Quality  control  of  printed  oligonucleotides.  The  duplicate  SNParrays  are 
hybridized  with  cy3-labelled  poly  (A)  probes.  Homogenous  signal  is  detected  along  the 
array.  The  last  pair  of  control  oligonucleotides  (non-poly  (T))  did  not  give  any  signal. 


Optimization  of  Probe  Specificity  on  SNParrays  (Task  3) 

PCR  reactions  for  probe  preparation  purposes  were  optimized  taking  into 
consideration  different  variables  that  affected  the  yield  and  the  specificity  of  the  product. 
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Reactions  are  performed  in  a  total  volume  of  10|xl,  in  the  presence  of  10)a.M  of  each  cold 
dNTP,  a  range  of  2-4mM  MgCla,  5pmol  of  each  allele  specific  primer,  lOpmol  of  the 
common  reverse  primer,  5|a.M  of  fluorescently  (cy5)  labeled  dCTP,  0.25U  of  Platinum  Taq 
polymerase,  and  lOng  genomic  DNA.  Different  annealing  temperatures  are  tested  (ranging 
from  55-65°C),  depending  on  the  melting  temperatures  of  the  PCR  primers  used.  For  the 
quality  control  purposes,  previously  known  homozygote  and  heterozygote  templates  are 
used  to  prepare  the  probes  for  each  SNP.  Each  probe  is  hybridized  to  duplicate  arrays  to 
ensure  that  the  genotypes  were  detected  correctly.  This  procedure  is  repeated  for  all  the 
SNPs  in  the  study. 

Initial  quality  control  experiments  have  revealed  that  only  10  of  31  SNPs  could  be 
detected  specifically  for  the  combination  of  three  allelic  statuses.  An  example  of  an 
unspecific  SNP  detection  is  given  for  IFA13  SNP  in  Figure  3.  Experiments  for  the 
unspecific  SNPs  were  repeated  using  a  wider  range  of  MgCl2  and  annealing  temperature 
conditions.  With  this  we  were  able  to  increase  the  specificity  and  recover  five  additional 
SNPs.  New  set  of  allele-specific  primers  were  designed  for  the  remaining  unsuccessful 
SNPs,  and  the  optimizations  were  carried  out  as  described  above.  We  were  able  to  obtain 
specific  detection  from  an  additional  of  four  SNPs.  As  a  conclusion,  we  were  able  to 
obtain  high  specificity  from  19  out  of  31  SNPs  in  the  study.  See  Table  1  for  the  working 
conditions  of  these  SNPs.  The  unspecific  hybridization  of  the  probes  was  mainly  resulted 
from  incompatibility  between  the  nature  of  the  sequences  around  the  corresponding  SNPs 
and  the  experimental  design  for  probe  preparation  for  SNP  arrays.  We  have  developed  an 
alternative  complementary  method  to  screen  for  these  SNPs  successfully  (see  TaqMan 
method  below). 

Multiplex  Hybridization  and  Signal  Detection  (Task  3  and  4) 

The  PCR  reactions  (probes)  are  prepared  for  each  SNP  using  96-well  microplates 
and  kept  frozen  until  all  the  SNPs  for  an  individual  DNA  is  completed.  After  that,  5|il  of 
each  PCR  was  pooled  in  another  plate  with  an  8-channel  micro-pipette,  and  mixed.  From 
the  pooled  probe  mixture,  15|il  was  removed  and  mixed  with  5  pi  of  a  hybridization 
mixture  (1.33xSSC,  0.067%  SDS,  0.033mg/ml  of  salmon  sperm  DNA).  This  mixture  is 
placed  over  the  SNParray  which  is  then  covered  with  a  cover-slip.  The  slides  are  incubated 
for  3  hrs  at  50®C,  after  which  they  are  washed  for  1 5  minutes  in  a  solution  of  2xSSC  and 
0.1%  SDS.  SDS  is  rinsed  off  with  two  additional  consecutive  washings  in  2xSSC,  15 
minute  each.  Slides  were  dried  in  a  centrifuge  at  600  rpm  for  1 0  minutes.  Slides  are  kept 
in  a  dark  and  dry  place  to  eliminate  the  fluorescence  bleaching  due  to  light  exposure. 

Detection  of  the  intensity  of  fluorescent  signal  after  hybridization  is  performed  in  a 
GenePix  4000B  slide  scanner  (Axon).  The  scan  results  are  then  analyzed  with  a  Genepix 
Pro  4.0  analysis  software.  The  software  uses  a  feature  indicator  which  sits  on  the 
SNParray.  Each  feature  in  this  block  defines  the  borders  of  individual  oligonucleotide 
spots.  Anything  outside  the  border  (until  the  next  spot)  is  considered  as  background,  and 
anything  inside  the  border  is  considered  as  gross  signal  intensity.  By  creating  a  special 
type  of  tab  delimited  text  file  (.gal),  oligonucleotide  names  can  be  defined  to  the 
individual  features.  This  file  is  created  with  the  names  of  oligonucleotides,  and  their 
specific  locations  are  defined  in  the  SNParray.  A  results  table  is  created  by  the  software, 
including  the  medians  and  means  of  the  gross  signal  intensity,  background  signal  intensity 
and  the  net  signal  intensity.  This  table  is  saved  as  a  tab  delimited  text  file  (.gpr  extension), 
which  can  be  opened  and  examined  in  a  Microsoft  Excel  worksheet  format.  The  means  of 
net  signal  intensity  (after  subtracting  the  background  from  the  gross  signal)  is  used  for 
determination  of  the  genotypes.  An  example  of  raw  (Table  2)  and  processed  data  (after 
subtraction  of  background  values)  (Figure  4)  are  given. 
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Figure  3:  Measuring  the  specificity  and  the  quality  of  each  SNP  specific  probe  to  detect 
allelic  status.  IFA13  SNP  was  studied  using  probes  prepared  from  homozygote  (CC  and 
TT)  and  heterozygote  (CT)  templates.  As  seen  in  the  figure  the  genotype  detection  was  not 
specific  to  the  allelic  status. 
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Table  1;  PCR  (probe  preparation)  conditions  for  19  SNPs  detection  of  which  were  specific 
on  SNParrays. 
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Table  2:  A  section  of  raw  data  table  created  by  Genepix  Pro  4.0  analysis  software.  X  and 
Y  show  the  specific  location  of  the  oligo  spot  on  the  slide.  F635:  gross  feature  (oligo  spot) 
intensity  at  635  nm  wavelength.  B635:  background  intensity  at  635  nm  wavelength. 
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Figure  4:  Output  of  a  SNParray  hybridization.  The  signal  intensities  (left  table)  were 
obtained  after  processing  of  the  raw  data  taking  into  consideration  the  background 
intensities.  Genotypes  (right  table)  were  obtained  on  the  basis  of  signal  ratios. 
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Development  of  SNParray  Software  (Task  4) 

We  have  designed  a  SNParray  software  to  organize  and  evaluate  the  large  amount 
of  raw  data  (Table  2)  obtained  fi-om  SNParray  scans.  Manual  analysis  and  evaluation  of 
the  raw  data  would  have  been  very  inefficient  considering  that  approximately  1000  seans 
are  proposed  to  be  performed  throughout  the  project.  This  SNParray  software  was 
designed  to  convert  the  net  signal  read  by  the  analysis  software,  to  different  genotypes  for 
each  SNP.  An  algorithm  was  prepared  first,  and  the  software  was  written  by  the  institute’s 
microarray  facility  in  PERL  (a  C  based  computer  language). 

The  efficiency  of  each  oligonucleotide,  and  therefore  their  signal  intensity  varies, 
providing  a  range  of  fluorescent  signal  intensities.  A  letter  coding  is  assigned  to  different 
level  of  intensity  values  to  distinguish  the  yield  and  the  quality  of  the  signal.  Signals  under 
certain  values  are  considered  as  background.  Software  is  able  to  read  the  signal- 
background  values  from  the  raw  data  table,  and  label  each  signal.  Different  genotypes  for 
each  SNP  are  defined  for  every  possible  letter  combination  in  the  algorithm,  considering 
the  defined  cut-off  values  of  all  oligonucleotides.  Each  of  the  three  genotype  categories  is 
subdivided  into  three  perfection  levels  (excellent,  good  and  poor)  considering  both  the  net 
signal  intensity  and  the  relative  intensities  of  two  allele  specific  oligonucleotide  of  the 
same  SNP.  The  results  that  fall  into  the  poor  category  will  be  repeated  using  a 
complementary  TaqMan  method  for  validation  purposes  (see  below). 

Method  Validation  (Task  4) 

The  SNParray  method  was  validated  for  three  selected  SNPs  using  two  additional 
methods  that  are  routinely  used  in  our  laboratory.  MTHFR  677C>T  and  MnSOD  Val- 
9Ala  polymorphims  were  validated  using  single  strand  conformation  analysis  (SSCP), 
and  ER  SerlOSer  SNP  is  validated  using  TaqMan,  5’  nuclease  allelic  discrimination 
assay.  The  discrepancy  rate  for  MnSOD  SNP  was  a  high  with  a  frequency  of  about  1 0%. 
This  was  expected  since  the  hybridization  signal  intensity  of  this  specific  SNP  was  very 
weak  on  the  SNParray.  The  results  of  MTHFR  and  ER  SNPs  were  highly  concordant,  with 
a  discrepancy  rate  of  less  than  1%.  The  validation  results  for  these  three  SNPs  are 
provided  in  Figure  5. 


(a)  ER  SerlOSer  polymorphism 
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(b)  VDR  MetlThr  polymorphism 
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(c)  MnSOD  Val-9Ala  polymorphism 
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Figure  5:  Validation  of  genotypes  of  three  SNPs  (ER  SerlOSer,  VDR  Metl  Thr  and 
MnSOD  Val-9Ala)  using  SNParrays,  SSSC  and  TaqMan  methods. 


Complementary  Screening  using  TaqMan  Method  (Task  3  and  Task  4) 

In  order  to  genotype  cases  and  controls  for  the  SNPs  that  could  not  be  studied 
using  SNParrays  we  have  developed  a  complementary  TaqMan  method.  This  method  uses 
the  Perkin-Elmer  (PE)  Applied  Biosystems  Sequence  Detection  7900  HT  System.  This 
PCR  based  detection  method  uses  allele-specific  fluorescent  probes,  with  a  different  label 
for  each  allele,  to  discriminate  between  alleles.  Probes  anneal  in  a  sequence-specific  (i.e. 
allele-specific)  manner  between  the  PCR  primers,  and  in  the  course  of  the  PCR  the  5'- 
nuclease  activity  of  the  Taq  polymerase  releases  the  reporter  dye  of  bound  probes  only, 
emitting  an  allele-specific  fluorescence.  The  reporter  fluorescent  signal  of  probes  is 
subdued  by  a  quencher  molecule  in  the  intact  probe,  and  does  not  release  a  signal.  This 
methodology  has  the  advantages  of  avoiding  the  use  of  restriction  digests,  hybridizations 
or  electrophoresis  thereby  avoiding  many  sources  of  error  and  allowing  high-throughput 
genotyping. 

Using  this  approach  we  will  have  been  optimizing  the  primers  and  probes  using  the 
Applied  Biosystems-approved  Primer  Express™  probe  and  primer  design  program.  The 
PCR  eonditions  are  applied  according  to  the  manufacturers  protocol.  Genotype  analysis 
and  calling  is  performed  on  amplified  samples  using  the  7900HT  software,  using  the 
standard  proeedures  for  automated  allelic  discrimination.  In  brief,  by  eomparison  to  the 
fluoreseence  signals  in  known  controls  (homozygote  allele  1 ,  homozygote  allele  2,  and  no 
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template),  the  software  will  call  each  “unknown”  sample  as  homozygote  allele  1, 
heterozygote,  homozygote  allele  2,  undetermined,  or  no  amplification  (Figure  6).  Outliers 
will  be  excluded  as  undetermined,  and  genotyping  of  these  samples  will  be  repeated. 

We  have  currently  optimized  the  conditions  for  six  and  completed  the  screening  of 
three  SNPs  using  this  approach. 


Estrogen  Receptor  (C270Tatcdio) 

TaqMan  Analysis,  96  well  plates 


-O.IO  0.100  0  300  0.5  0.090  0.899  1.100  1.299 

AlleteX(MGB21  c) 


Figure  6:  Output  from  Taqman  assay.  Genotypes  (CC,  XT  and  CT)  are  accumulated  at 
different  coordinates  due  to  their  signal  intensity.  This  is  done  automatically  by  TaqMan 
software. 


Plating  of  Subject  DNA  Samples  (Task  3  and  Task  4) 

DNA  samples  of  breast  cancer  cases  and  population  controls  are  organized  into  10 
lists,  each  representing  a  96-well  micro-plate  format.  DNA  samples  are  diluted  to  5  ng/pl 
concentration,  and  transferred  into  96-well  master  plates.  Using  an  electronic  8-channel 
micropipette  dispenser,  numerous  replica  plates  are  prepared  each  well  containing  a  total 
of  5ng  of  genomic  DNA.  Each  replica  plate  is  labeled,  covered  individually  after  all  the 
DNA  is  air-dried,  and  stored  at  room  temperature  until  they  are  used.  An  example  for  a 
plate  organization  is  given  in  Figure  7.  Each  plate  is  prepared  to  contain  three  types  of 
controls: 

(a)  Negative  template  controls;  four  negative  DNA  control  samples  are  prepared  in  the 
same  locations  on  each  plate,  controlling  any  possible  DNA  contamination. 


14 


(b)  Experimental  controls;  12  cell-line  DNA  samples  (previously  sequenced  for  all  the 
listed  SNPs)  are  located  in  the  same  coordinates  in  each  plate.  The  results  from 
these  cell  lines  are  used  to  evaluate  the  quality  of  probe  preparation  and 
hybridization. 

(c)  Validation  controls;  10  DNA  samples  from  each  list  are  repeated  in  the  following 
plate  to  evaluate  the  reproducibility  of  the  results  and  to  prevent  DNA  sample  mix- 
up  during  master  plate  and/or  replica  plate  preparations. 
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Figure  7:  Design  of  subject  DNAs  and  controls  on  the  96-well  microplate. 


Screening  of  Breast  Cancer  Cases  and  Population  Controls  (Task  4) 

Currently  we  are  screening  cases  and  controls  for  19  SNPs  using  the  SNParray 
strategy  described  above.  We  are  about  to  complete  the  results  of  the  four  out  of  total  1 0 
plates.  We  are  also  currently  optimizing  the  TaqMan  assay  conditions  for  the  remaining 
SNPs  which  were  unsuccessful  with  SNP  arrays.  Currently  we  have  completed  the 
screening  of  three  out  of  20  SNPs  using  TaqMan  method. 

Immediate  Future  Task 

Using  the  SNP  arrays  and  TaqMan  method  we  will  complete  the  screening  of  all 
the  SNPs.  Quality  control  and  validation  studies  will  be  carried  out  throughout  the 
screening  to  assess  the  reproducibility  and  accuracy.  The  results  will  be  statistically 
analyzed  to  determine  the  association  of  SNPs  with  breast  cancer  risk. 

Key  Research  Accomplishments 

We  have  accomplished  the  tasks  proposed  in  the  Statement  of  Work  by 

■  designing  SNParray  format  and  moving  forward  with  their  mass  production 

■  optimizing  the  specificity  of  each  probe  individually  on  SNP  arrays 

■  developing  conditions  for  multiplex  hybridization 

■  developing  complementary  genotyping  method  (TaqMan)  for  screening  SNPs  that 
were  unspecific  in  with  the  SNParray  protocol 

■  developing  a  complicated  software  to  organize  and  evaluate  the  raw  data  obtained 
from  SNParray  scans 

■  Validating  the  SNParray  method  with  two  other  genotyping  methods 

■  preparing  and  plating  all  the  subject  DNA  samples  to  be  used  in  the  study 

■  screening  a  portion  of  the  study  cases  and  controls 


Reportable  Outcomes 

This  work  has  been  presented  at  following  conferences. 

1 .  Venus  Onay,  Julia  Knight,  and  Hilmi  Ozcelik,  "Candidate  SNP  Analysis  in  the  Study  of 
Breast  Cancer  Risk  Using  SNParrays."  Controversies  in  the  Etiology,  Detection  and 
Treatment  of  Breast  Cancer:2002,  June  13-14,2002,  Toronto,  Ontario,  Canada. 

2.  Venus  Onay,  Julia  Knight,  and  Hilmi  Ozcelik,  "Candidate  SNP  Analysis  in  the  Study  of 
Breast  Cancer  Risk  Using  SNParrays."  93*^^  Annual  Meeting  of  AACR,  April  6-10,2002, 
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3.  Venus  Onay,  Julia  Knight,  and  Hilmi  Ozcelik,  "Identifying  the  Role  of  SNPs  in  Breast 
Cancer  Risk  Using  Microarray  Technology."  Oncogenomics  Conference,  25-27  January 
2001,  Tucson,  Arizona,  USA. 

4.  Venus  Onay,  Julia  Knight,  and  Hilmi  Ozcelik,  "Microarray  Technology  to  Study  the 
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